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Abstract 

We consider general stochastic systems of interacting particles with noise which 
are relevant as models for the collective behavior of animals, and rigorously prove 
that in the mean-field limit the system is close to the solution of a kinetic PDE. 
Our aim is to include models widely studied in the literature such as the Cucker- 
Smale model, adding noise to the behavior of individuals. The difficulty, as compared 
to the classical case of globally Lipschitz potentials, is that in several models the 
interaction potential between particles is only locally Lipschitz, the local Lipschitz 
constant growing to infinity with the size of the region considered. With this in mind, 
we present an extension of the classical theory for globally Lipschitz interactions, 
which works for only locally Lipschitz ones. 
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1 Introduction 

The formation of large-scale structures (patterns) without the need of leadership (self- 
organization) is one of the most interesting and not completely understood aspect in the 
collective behavior of certain animals, such as birds, fish or insects. This phenomena has 
attracted lots of attention in the scientific community, see IE1Q31E31G71 and the references 
therein. 
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Most of the proposed models in the literature are based on particle-like description of 
a set of large individuals; these models are called Individual-Based Models (IBM). IBMs 
typically include several interactions between individuals depending on the species, the 
precise mechanism of interaction of the animals and their particular biological environ- 
ment. However, most of these IBMs include at least three basic effects: a short-range re- 
pulsion, a long-range attraction and a "mimicking" behavior for individuals encountered 
in certain spatial regions. This so-called three-zone model was first used for describing 
fish schools in [|2l|25| becoming a cornerstone of swarming modelling, see [|3ll24l. 

The behavior of a large system of individuals can be studied through mesoscopic 
descriptions of the system based on the evolution of the probability density of finding 
individuals in phase space. These descriptions are usually expressed in terms of space- 
inhomogeneous kinetic PDEs and the scaling limit of the interacting particle system to 
analyze is usually called the mean-field limit. These kinetic equations are useful in bridg- 
ing the gap between a microscopic description in terms of IBMs and macroscopic or hy- 
drodynamic descriptions for the particle probability density. We refer to the review [fT2l 
for the different connections between these models and for a larger set of references. 

The mean-field limit of deterministic interacting particle systems is a classical question 
in kinetic theory, and was treated in [[61 [181 EH in the case of the Vlasov equation. In 
these papers, the particle pairwise interaction is given by a globally bounded Lipschitz 
force field. Some of the recent models of swarming introduced in |fT9l [T6l 1221 do not 
belong to this class due to their growth at infinity leading to an interaction kernel which 
is only locally Lipschitz. These IBMs are kinetic models in essence since the interactions 
between individuals are at the level of the velocity variable to "align" their movements for 
instance or to impose a limiting "cruising speed". The mean-field limit for deterministic 
particle systems for some models of collective behavior with locally Lipschitz interactions 
was recently analysed in [9] showing that they follow the expected Vlasov-like kinetic 
equations. 

On the other hand, noise at the level of the IBMs is an important issue since we cannot 
expect animals to react in a completely deterministic way. Therefore, including noise 
in these IBMs and thus, at the level of the kinetic equation is an important modelling 
ingredient. This stochastic mean-field limit formally leads to kinetic Fokker-Planck like 
equations for second order models as already pointed out in IfTOll . The rigorous proof of 
this stochastic mean-field limit has been carried out for globally Lipschitz interactions in 
11341 13011, see also ||29l. 

This work is devoted to the rigorous analysis of the stochastic mean-field limit of lo- 
cally Lipschitz interactions that include relevant swarming models in the literature such 
as those in |[P9l [T6l . We will be concerned with searching the rate of convergence, as 
the number of particles N — > oo, of the distribution of each of the particles and of the 
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empirical measure of the particle system to the solution of the kinetic equation. This con- 
vergence will also establish the propagation of chaos as iV — > oo for the particle system 
and will be measured in terms of distances between probability measures. Here, we will 
not deal with uniform in time estimates since no stabilizing behaviour can be expected in 
this generality, such estimates were obtained only in a specific instance of Vlasov-Fokker- 
Planck equation, see (5|. The main price to pay to include possible growth at infinity of 
the Lipschitz constants of the interaction fields will be at the level of moment control es- 
timates. Then, there will be a trade-off between the requirements on the interaction and 
the decay at infinity of the laws of the processes at the initial time. 

The work is organized as follows: in the next two subsections we will make a pre- 
cise descriptions of the main results of this work, given in Theorems 11.11 and 11.21 below, 
together with a small overview of preliminary classical well-known facts and a list of 
examples, variants and particular cases of applications in swarming models. The second 
section includes the proof of the stochastic mean-field limit of locally Lipschitz interact- 
ing particle systems under certain moment control assumptions (thus proving Theorem 
11.11) . Finally, the third section will be devoted to the proof of Theorem II .2[ a result of 
existence and uniqueness of the nonlinear partial differential equation and its associated 
nonlinear stochastic differential equation, for which the stochastic mean-field limit result 
can be applied. The argument will be performed in the natural space of probability mea- 
sures by an extension to our diffusion setting of classical characteristics arguments for 
transport equations. 

1.1 Main results 

We will start by introducing the two instances of IBMs that triggered this research. The 
IBM proposed in [fT9ll includes an effective pairwise potential U : R d — > R modeling 
the short-range repulsion and long-range attraction. The only "mimicking" interaction in 
this model is encoded in a relaxation term for the velocity arising as the equilibrium speed 
from the competing effects of self-propulsion and friction of the individuals. We will refer 
to it as the D'Orsogna et al model in the rest. More precisely, this IBM for iV-particles in 
the mean-field limit scaling reads as: 

V\ 

(a-/3 \Vf)V* ~^J2 W d X * - X 'D> 

where a > measures the self-propulsion strength of individuals, whereas the term cor- 
responding to > is the friction assumed to follow Rayleigh's law. A typical choice for 



' dX l 
~dT 

~~dT 
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U is a smooth radial potential given by 

U(x) = -C A e~ N2/ ^ + C R e-^ 2/ ^. 

where Ca, Cr and I a, £r are the strengths and the typical lengths of attraction and repul- 
sion, respectively. 

The other motivating example introduced in [16] only includes an "alignment" or re- 
orientation interaction effect and we will refer to it as the Cucker-Smale model. Each 
individual in the group adjust their relative velocity by averaging with all the others. This 
averaging is weighted in such a way that closer individuals have more influence than fur- 
ther ones. For a system with N individuals the Cucker-Smale model in the mean-field 
scaling reads as 

N 



dt N 

with the communication rate matrix given by: 



w ij = w{\X i -Xt\) 



\X l - X3 



2^7 



for some 7 > 0. We refer to lfT6l l22l ITTL [T2l| and references therein for further discussion 
about this model and qualitative properties. Let us remark that both can be considered 
particular instances of a general IBM of the form 

dt 

UV* 1 A ... (i- 1 ) 

— = -F(X\ V l )-—J2 H ( X * ~X\V l - V J )dt, 1 < % < N 

i=i 

where F, H : R 2d — > R are suitable functions: the D'Orsogna et al model with F(x, v) = 
(f3\v\ 2 — a)v and H(x,v) = V x U(x) and the Cucker-Smale model with F = and 
H(x,v) = w(x)v. Let us emphasize that F in the D'Orsogna et al model and H in the 
Cucker-Smale model are not globally Lipschitz functions in R 2d . 

Our aim is to deal with a general system of interacting particles of the type ( ll.lt with 
added noise and suitable hypotheses on F and H including our motivating examples. 
More precisely, we will work then with a general large system of iV interacting R 2d -valued 
processes (JQ, V t l ) t >o with 1 < i < N solution of 



dX\ = V t l dt, 



1 N (12) 

dv? = V2dBt - F(xi, v*)dt - ^J2 H ( x t - x ?> v t - V t) dt > 
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with independent and commonly distributed initial data (Xq, Vq) with 1 < i < N. Here, 
and throughout this paper, the (B % t ) t >o are N independent standard Brownian motions 
in R d . More general diffusion coefficients will be considered in the next subsection. The 
asymptotic behavior of the Cucker-Smale system with added noise has been recently con- 
sidered in [fT51 . and eq. (11.21) includes as a particular case the continuous-time models 
discussed there. Our main objective will be to study the large-particle number limit in 
their mean-field limit scaling. It is sometimes usual to write (Xl' N , V^' N ) to track iV in- 
dividuals, but to avoid a cumbersome notation we will drop the superscript N unless the 
dependence on it needs to be emphasized. 

By symmetry of the initial configuration and of the evolution, all particles have the 
same distribution on R 2d at time t, which will be denoted / t . For any given t > the 
particles get correlated due to the nonlocal term 

1 - 

-jfY< H ( x i- x ii v i- v t) 

3=1 

in the evolution, though they are independent at initial time. But, since the pairwise action 
of two particles i and j is of order 1/N, it seems reasonable that two of these interacting 
particles (or a fixed number k of them) become less and less correlated as N gets large: 
this is what is called propagation of chaos. The statistical quantities of the system are 
given by the empirical measure 

1 N 

i=i 

It is a general fact, see Sznitman [|34ll , that propagation of chaos for a symmetric system 
of interacting particles is equivalent to the convergence in N of their empirical measure. 
Following [|34l we shall prove quantitative versions of these equivalent results. 

We shall show that our N interacting processes (JQ, V t l ) t >o respectively behave as 
N — > oo like the processes (X], V\) t >s, solutions of the kinetic McKean-Vlasov type 
equation on \R 2d 

dT t = T t dt 

dT t = V2dB\ - F(T t Xt)dt - H * f t (T t ,V t )dt, (1.3) 

(T y ) = {xi ia f t = iaw(r t ,r t ). 

Here the Brownian motions (Bl) t > are those governing the evolution of the (X'l, V7) t > . 
Note that the above set of equations involves the condition that f t is the distribution of 
(X\, V]), thus making it nonlinear. The processes (X], V])t>o with i > 1 are independent 
since the initial conditions and driving Brownian motions are independent. Moreover they 
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are identically distributed and, by the Ito formula, their common law f t at time t should 
evolve according to 

d t ft + v ■ V x f t = A v f t + V„ ■ ((F + H * f t ) f t ), t > 0, x, v G R d . (1.4) 

Here a • b denotes the scalar product of two vectors a and b in R d and * stands for the 
convolution with respect to (x, v) G R 2d : 

H * f(x) = / H(x — y,v — w) f(y,w) dy dw. 

Moreover, V z stands for the gradient with respect to the position variable x G R d whereas 
V v , V„- and A„ respectively stand for the gradient, divergence and Laplace operators 
with respect to the velocity variable v G R d . 

Assuming the well-posedness of the stochastic differential system (11.21) and of the 
nonlinear equation (|1.3I) together with some uniform moment bounds, we will obtain 
our main result on the stochastic mean-field limit. Existence and uniqueness of solutions 
to (11.21) . (11.31) and (11.41) verifying the assumptions of the theorem will also be studied but 
with more restrictive assumptions on F and H that we will comment on below. 

Theorem 1.1. Let f be a Borel probability measure and (Xq, Vq) for 1 < i < N be N 
independent variables with law fo- Assume that the drift F and the antisymmetric kernel 
H, with H(—x, —v) = —H(x, v), satisfy that there exist constants A, L, p > such that 

-(v-w)- (F(x, v) - F(x, w)) < A\v — w\ 2 (1.5) 
\F(x,v) -F(y,v)\ < Limn{\x-y\,l}{l + \v\ p ) (1.6) 

for all x, y, v, w in R d , and analogously for H instead of F. Take T > 0. Furthermore, 
assume that the particle system (11.21) and the processes (11.31) have global solutions on 
[0, T] with initial data (Xq, Vq) such that 

sup{/ \H(x-y,v-w)\ 2 df t (x,v)df t (y,w) + [ (\x\ 2 +e a ^ P )df t (x, v)\ < +oo, (1.7) 
with ft = law(X t , V t ). Then there exists a constant C > such that 

E[\x}-T t \ 2 + \v;-V t \ 2 } (i.8) 

for allO < t<T and N > 1. 

Moreover, if additionally there exists p' > p such that 

sup / e alvl "' df t (x,v) < +oo, (1.9) 
o<i<r,/iR2<i 

then for all < e < 1 there exists a constant C such that 

E[\xi-T t \ 2 + \vi-T t \ 2 ] (l.io) 

for allO < t<T and N > 1. 
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This result classically ensures quantitative estimates on the mean field limit and the 
propagation of chaos. First of all, it ensures that the common law / t of any (by ex- 
changeability) of the particles X\ at time t converges to f t as N goes to infinity, as we 
have 



W 2 2 {tf\ f t ) < E[\X\ - X\\ 2 + \Vj - T t \ 2 } < e(N) 



(1.11) 



Here W2 stands for the Wasserstein distance between two measures ji and v in the set 
T > 2(R 2d ) of Borel probability measures on R 2d with finite moment of order 2 defined by 



W 2 (n,v) 



inf {E \\Z-Z\ 2 }} 

(Z,Z) 



where the infimum runs over all couples of random variables (Z, Z) in R 2d x R 2d with 
Z having law fj, and Z having law v (see [|36ll for instance). Moreover e(N) denotes the 
quantity in the right hand side of (11.81) or (11.101) , depending on which part of Theorem ll.il 
we are using. 

Moreover, it proves a quantitative version of propagation of chaos: for all fixed k, the 
law ft of any (by exchangeability) k particles (XI, V t L ) converges to the tensor product 
ff k as N goes to infinity, according to 



W 2 2 {f, 



1 Jt 



< E 

< kE 



1 — 1 



xt v t k ) 



{x\y 



t ■ 



xt v k t ) 



v\\ 



< ke{N). 



It finally gives the following quantitative result on the convergence of the empirical 
measure ff of the particle system to the distribution f t : if ip is a Lipschitz map on R 2d , 
then 



E 



1 N r 
-J2v(xi,v;)- / cpdf t 



< 2E 



N 



1=1 



pdf t 



< e(N) + 



C_ 
N 



by Theorem [TTI and argument on the independent variables (X t , V t ) based on the law of 
large numbers, see P34ll . 

The argument of Theorem II .11 is classical for globally Lipschitz drifts ff34l l30l . For 
space-homogeneous kinetic models it was extended to non-Lipschitz drifts by means of 
convexity arguments, first in one dimension in dUl, then more generally in any dimension 
in lfT3ll27ll . Here, in our space inhomogeneous setting, sole convexity arguments are hope- 
less, and we will replace them by moment arguments using hypothesis (11.71) . We also refer 
to ||3T1 13 for related problems and biological discussions in space-homogeneous kinetic 
models with globally Lipschitz drifts but nonlinear diffusions. 
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Our proof will be written for p > 0, but one can simplify it with p = 0, by only 
assuming finite moments of order 2 in position and velocity. In this case our proof is the 
classical Sznitman's proof for existence, uniqueness, and mean-field limit for globally 
Lipschitz drifts, written in our kinetic setting and giving the classical decay rate in (11.81) 
as 1/N, compared to (|1.8I) - (1 1.101) . We will discuss further examples related to swarming 
models and extensions in subsection |1.2[ 

Section [3] will be devoted to the proof of existence, uniqueness, and moment propaga- 
tion properties (|1.7I) and (11.91) for solutions to (11.21) . (11.31) and (11.41) . This well-posedness 
results and moment control for solutions will be obtained under more restrictive assump- 
tions that those used in the proof of Theorem [TTTJ 

Theorem 1.2. Assume that the drift F and the kernel H are locally Lipschitz functions 
satisfying that there exist C, L > and < p < 2 such that 

-v-F(x,v) <C(1 + M 2 ) (1.12) 

-(v-w) ■ {F(x,v)-F(x,w)) < L\v -w\ 2 (l + \v\ p +\w\ p ), (1.13) 

\F(x,v) -F(y,v)\ < L\x-y\(l + \v\ p ), (1.14) 

\H(x,v)\<C(l + \v\), (1.15) 

\H(x, v) - H(y,w)\ < L(\x -y\ + \v- w\)(l + \v\ p + \w\ p ), (1.16) 

for all x,v,y,w G R d . Let f be a Borel probability measure on R 2d such that 

[ (\x\ 2 + e aMP ')df {x,v) < +oo. 

for some p' > p. Finally, let (Xq, Vq) for 1 < i < N be N independent variables with 
law f . Then, 

i) There exists a pathwise unique global solution to the SDE (|1.2I) with initial data 

(x i , VS). 

ii) There exists a pathwise unique global solution to the nonlinear SDE (11.31) with 
initial datum (Xq, Vq). 

Hi) There exists a unique global solution to the nonlinear PDE (11.41) with initial datum f^. 

Moreover, for allT > there exists b > such that 

sup / (\x\ 2 + e h ^ P ) df t (x, v) < +oo. 

0<t<T i|R2d 

Concerning the hypotheses on F, let us remark that we could also ask F to satisfy 
similar properties as H in fll.15lHll.161 ), but dl.12t - dl.14ft are slightly weaker. 
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1.2 Examples, extensions and variants 

As discussed above the drift F models exterior or local effects, such as self propulsion, 
friction and confinement. In our motivating examples F(x, v) = in the Cucker-Smale 
model and F(x, v) = ((3\v\ 2 — at)v in the D'Orsogna et al model. On the other hand, H 
models the interaction between individuals at (x, v) and (y, w) in the phase space being 
H(x,v) = a(x)v with a(x) = (1 + |x| 2 ) -7 , 7 > in the Cucker-Smale model and 
H(x,v) = —VU(x) in the D'Orsogna et al model. It is straightforward to check the 
assumptions of Theorems 11.11 and 1 1 . 21 in these two cases. 

Of course, more general relaxation terms towards fixed "cruising speed" are allowed 
in the assumptions of Theorem 11.11 for instance: F(x,v) = ((3(x)\v\ s — a(x))v with 
a, f3 globally Lipschitz bounded away from zero and infinity functions and 5 > 0. Also, 
concerning the interaction kernel we may allow H(x,v) = a(x)\v \ q ~ 2 v with q > 1 for 
a bounded and Lipschitz a in Cucker Smale as introduced in ||23ll . This has the effect of 
changing the equilibration rate towards flocking, see IfTTl |23l for details. However, the 
assumptions on existence and moment control in Theorem [L2] are only verified for q = 2. 
Other more general mechanisms can be included such as the one described in ||26ll . 

1.2.1 Variants on the assumptions 

We first remark two simple extensions of the results in Theorem [TTT] by trading off growth 
control on F and H by moment control of the solutions to (11.21) : 

VI. Theorem 11.11 holds while weakening assumption (11.51 ) on F and H to 

(v-w)- (F(x, v) - F(x, w)) > -A\v - w\ 2 (l + \v\ p + \w\ p ) 
both for F and H. Solutions in this case need to satisfy 

< +00 



sup sup E 

N>1 0<t<T 



on the particle system or equivalent conditions on p' for the second estimate (11.101) . 
Observe by lower continuity and weak convergence in N of the law of (JQ 1 , V^) to 
the law ft of (X t , v\) that this is a stronger assumption than part of the assumption 
(11.71) made in Theorem ll.il more precisely 



sup E 

0<t<T 



e bW\\ p 



< +OO 



Observe also that we may not have global existence in this case since for instance 
F(x,v) = —v 3 on R 2 , which leads to blow up in finite time, satisfies this new 
condition with p = 2. 
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V2. One can remove the antisymmetry assumption on H in Theorem ll.ll by imposing 

\H(x, v) — H(x, w)\ < A \v — w\ 

instead of (11.51) for H. The reader can check that very little modifications are needed 
at the only point in the proof below where the symmetry of H is used, namely, when 
bounding term I 2 \. Actually, one can directly carry out the estimates instead of 
symmetrizing the term first. From the modeling point of view, it is important to in- 
clude the non- antisymmetric case since some more refined swarming IBMs include 
the so-called "cone of vision" or "interaction region". In these models, individuals 
cannot interact with all the others but rather to a restricted set of individuals they 
actually see or feel, see ll26l[T2l [Ti. From the mathematical point of view this im- 
plies that the interaction term H * f t need not always be a convolution but must be 
replaced by 

H[ft]{x,v)= H(x,v;y,w)df t (y,w); 

here H(x,v;-,-) is compactly supported in a region that depends on the value of 
(x, v) and H(y, w; x, v) is not necessarily equal to —H(x, v; y, w). Our results ex- 
tend to this case. 

V3. Theorem 11.11 also holds when F is an exterior drift in position only, non globally 
Lipschitz, for instance satisfying 

\F{x) - F(y)\ < A\x -y\(l+ \x\ q + \y\ q ) 

with q > 0. Now, the moment control condition (11.71) has to be reinforced by as- 



suming 

\b\*}\ q 



sup E 

0<t<T 



e b\x l t \* 



< +oo and sup sup E 

N>1 0<t<T 



e 



< +oo 



on the particle system or equivalent conditions on p' for the second estimate (11.101) . 
Observe again by weak convergence in N of the law of (X^, V^) to the law f t of 
(X t , v\) that the latter new moment control assumption is stronger. 

1.2.2 Extensions to nonlinearly dependent diffusion coefficient 

Some researchers have recently argued that the diffusion coefficient at a given point x 
may depend on the neighbours of the point to be considered [J20l |37]|. More precisely, 
they can depend on local in space averaged quantities of the swarm, such as the averaged 
local density or velocity. The averaged local density at x in the particle system (X\, V t l ) 
for 1 < i < N is defined as 



1 N 
-5>(s-X/); 
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from which its corresponding continuous version is 

rj £ (x - y)df t (y,w). 



! R 2d 

Here r) e (x) = jzV^^j where rj is a nonnegative radial nonincreasing function with unit 
integral but non necessarily compactly supported and e measures the size of the interac- 
tion. The name of "local average" comes from the smearing of choosing r] £ instead of a 
Dirac delta at 0, which would be meaningless in the setting of a particle system. Such a 
diffusion coefficient is considered in ll20ll with r]{x) = 1+ ^.p an d £ — h from the point of 
view of the long-time behaviour of solutions to the kinetic equation, not of the mean-field 
limit: there the particle system evolves according to the diffusive Cucker-Smale model 

' dX\ = Vjdt 



dv; 



\ 



N N 

- <n - xi) dB\ <xi - x\\vi - vi)dt, i<t<x 



N ^— ' v 1 N 

3=1 3=1 



witha(x) = j^p. 

Other local quantities upon which the diffusion coefficient may depend on is the aver- 
aged local velocity at x defined as 

1 - 

3=± 

in the particle system, and 

wt) £ (x- y)df t (y,w) 



l R 2d 

in the continuous setting. More generically, we can consider diffusion coefficients in the 
particle system such as 

9[i-,y]h(Vi) n Jx-X')\. (1.17) 



(vtm 

\ 3=1 



Here, rj e controls which individuals we should take into account in the average and with 
which strength; among these X- 7 , how each velocity influences at x is controlled by h; 
finally, after averaging over those j, g controls how we should compute the diffusion 
coefficient. 

For instance, given the diffusion coefficient g(u(x)), we could argue that g should be 
large for small u(x) (large noise for small velocity), and conversely; there we see g as an 
even function, nonincreasing on R + . Similar coefficients were used in ll37l of the form 
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with t)(x) = Zt\ x \<\ and N(x) = \x — X 3 t \ < e}. However, we cannot include this 
scaling in the mean-field setting. 

On the other hand, mean-field limits such as those in Theorem [Tj] have been obtained 
in [[301 [341 with the diffusion coefficient 

1 N 

where a is a 2d x 2d matrix with globally Lipschitz coefficients. 

We include the two variants above by considering diffusion coefficients of the form 



n 



where, for a probability measure / on R 2d , a[z; /] is a 2d x 2d matrix with coefficients 

<?ki[z; f] = g{ I a M (z,z')df(z') 

V/ip2d ' 



R 2d 
2d 



in the notation z = (x, v), z' = (x', v') G R . We shall assume that g is globally Lipschitz 
on R and 

\cr k i(z,z') - a M (z,z')\ 
< C(min{|a; - x\ + \x' - x'\, 1} + \v - v\ + \v' - v'\)(l + \v\ q + |t>'| 9 + \v\ q + \v'\ g .) 

In this notation, ll34l corresponds to g{x) = x and a k i bounded and Lipschitz, and (11.171) 

to aki(z, z') = h(v') rj £ (x — x 1 ) where the kernel i] £ is bounded and Lipschitz and 

\h{v) - h{v')\ < C\v - v'\(l + \v\ q + \v'\ q ). 

Observe that this framework does not include the model considered in ||20ll for which the 
diffusion coefficient is given by a non locally Lipschitz g. 

In this notation and assumption, if furthermore there exists b > such that 



sup E 

0<t<T 



< +oo and sup sup E 

N>1 0<t<T 



< +oo 



on the nonlinear process and particle system, then (11.81) holds in Theorem ll.il and corre- 
spondingly with p' for the second estimate (11.10b (see Remark [2Tb . 



1.2.3 One-variable formulation 

We now give a formulation of the mean-field limit in one variable z G R D , to be thought of 
as z = (x, v) G R 2d as in our examples above or as z = v G R d in a space-homogeneous 
setting. We consider the particle system 

1 N 

dZ\ = adB\ - F(Zt) dt - ^J2 H ( Z i " dt > 1 <^< N 

i=i 
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where a is a (for instance) constant D x D matrix, the {B\) t > Q are N independent stan- 
dard Brownian motions on R D and the initial data Z'q are independent and identically 
distributed. We also consider the nonlinear processes (Z t ) t > defined by 

'dzl = odB\ - F{t t )dt -H* ft(zl)dt, 
% = Zl / t = law(Z<). 

Assume now that there exists C > such that 

(z - z') ■ (F(z) - F{z')) > -C\z - z'\ 2 (l + \z\ p + \z'\ p ) 

for all z, z' E R D with p > 0. Assume also global existence and uniqueness of these 
processes, with 

< +oo 



sup sup E 

Af>l 0<t<T 



and 



sup ( / e b ^ P df t (z) + / \H(z - z')\ 2 df t (z) df t (z')\ < +oo, 

)<t<T Urd J R 2D J 



0<t<T 

or equivalent conditions on p'. Then (|1.8I) and (11.101) in Theorem 11.11 holds . 

2 Mean-field limit: proof 

This section is devoted to the proof of Theorem 11.11 We follow the coupling method 
ir3"4ll3~0~ll3~5l . Given T > 0, we will use C to denote diverse constants depending on T, the 
functions F and H, and moments of the solution f t on [0, T], but not on the number of 
particles N. 

Proof of Theorem \L1\ Let us define the fluctuations as x\ := X\ — X t , v\ := V t l — V], 
i = 1, . . . , N. For notational convenience, we shall drop the time dependence in the 
stochastic processes. As the Brownian motions (B l t ) t >Q considered in (|1.2I) and (11.31) are 
equal, for all i = 1, . . . , N, we deduce 

dx { =v { dt, (2.1) 
dv'= - (F(X i ,V i ) - F{X\T))dt 
1 N 

'nYI (h(X'-X\ V* - V j ) - (H * f t )(T,V)) dt. (2.2) 
i=i 

Let us consider the quantity a(t) = E [\x z \ 2 + |t>*| 2 ] (independent of the label i by sym- 
metry), which bounds the distance W^if^, ft) as remarked in (11.111) . Then, by using 
(l27Tb-(l2T2h. we readily get 

ljE[\xf]=E[x*-v*]<\a(t) (2.3) 
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and 



1 d 



~-E [H 2 ] = -E \v* ■ {F(X\ V*) - F(T,V)) 



1 



' N 

£V ■ {h^-x^v'-v^ - h * f t (T ,T)) 

.3=1 



--■h + h. (2.4) 



Step 1.- Estimate I\ by moment bounds: We decompose 1\ in (12.41) as 



/i = -E 



{F{X\V i )-F{X\V i )) 



E 



■ (F(X\V) - F(X\V 1 )) 



By assumption dl.5h - dl.6h on F, I\ can be controlled by 

Ji<AE [M 2 ] +LE [|^| min{|^|, 1} (1 + \T\ P ) 



Given i? > 0, the second term I 12 is estimated according to 



J 12 < E[|u*| + E 



|V l |<R 



^(i + ^eO^h^i] + ^E[|^| 2 ] +^e 



v'\ min{|x*|,l} \T\ 
1 



+ E 



l r|>K Hmin{M,l}|rr 



< (1 + i?)a(t) + i ( E 



1/2 



E 



\V'\>R 



\T\>R 



1/2 



2p 



by the Young and the Cauchy-Schwarz inequalities. Invoking the Markov inequality, hy- 
pothesis (1 1 .7b implies that there exists C > such that 



E 



iv;i>A 



< e"^ E 



<Ce- aRP 



(2.5) 



for all i and < t < T. By defining r = aR p /2, we conclude that given T > 0, there 
exists C > such that 

h < C{l+r)a{t) + Ce~ r (2.6) 
holds for all r > and all < t < T. 

Step 2.- Estimate I 2 by moment bounds: We decompose the second term in (|2.4h as 



1 

TV' 



:E 



[H(X i -X j ,V i -V j ) - H(X l -X\V l -V 3 



--E\v*-[H(0,0)-(H*f t )(X X 



(2.7) 



N 



E 



N 



J2 v l ■ [H (T - X 3 , v 1 - V 3 ) - (H * f t ) (x l , T 

-'■ hi + -^22 + -^23- 
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Since all particles are equally distributed and H is antisymmetric, we rewrite hi as 



1 N 

hi = "2^5 E E [V - ' [HilC-Xt, V*-V') - H(t-X\V-V r 



Analogously to the argument used to bound I\ in the first step, for each we introduce 
the intermediate term H (X 1 — X 3 1 , V* — V 3 ) , split the expres sion in two terms , and estimate 
the corresponding expectations using dl.5b - dl.6b on H by 



hi < 



AE[\ 



v — v 



il 2 l 



] + L E \v*- v j \ min{|^ - ar>'|, 1} (1 + |F - r j 



(2.8) 



For a given _R > 0, and fixed consider the event 1Z := {\Vi\ < R, \Vj\ < R} 

and the random variable Z := \v l — tP\ min{|x l — 1} (1 + \V — V 3 \ p ). Then the 
last expectation in (12.81) can be estimated as follows, using again the Young and Cauchy- 
Schwarz inequalities: 

E \Z\ = E [t n Z] + E [t n cZ] 

< (1 + 2 P R P ) E [\v* - v°\ \x l - x j \] + ^E [\v< - v j \ 2 } 



+ E 



l n c {1 + \V - V 1 \ p ) 2 



< 2(1 + 2 P R P ) a(t) + 2a(t) + (E [l^c]) 172 ( E 



< 2(2 + 2 p R p ) a(t) + C(E 
<C(l + R p )a(t) + Ce- aRP/2 



1— i 

\V \>R 



+ E 



1 



|W|>/? 



\V - V 

1/2 



1/2 



1 + E 



|y*|4p 



1/2 



(2.9) 



by hypothesis (1 1 .7b . Inserting (12.91 ) into (12.81 ) and defining r = aR p /2, we conclude that 
given T > 0, there exists C > such that 



J 21 < C(l + r)a(t) + CV 



(2.10) 



holds for all r > and all < t < T. 

We now turn to estimate I 2 2, i.e., the second term in (12.71) . Using that H(0, 0) = 0, we 

get 



1 



^<^W 2 ]) 1/2 ( E \(H*f t )(T,v 



N 



The latter inequality follows from 



1/2 C 



E 



\(H*f t )(T,V)\ 



R 4d 



\H(x -y,v- w)\ 2 df t (x,v) df t (y,w), 



(2.11) 



(2.12) 



which is bounded on [0, T] due to hypothesis (1 1 .71 ). 
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The last term J 2 3 is treated as in the classical case in [34, Page 175] by a law of large 
numbers argument. We include here some details for the sake of the reader. By symmetry 
we assume that i = 1. We start by applying the Cauchy-Schwarz inequality to obtain 



1 



' 23 <^(E[KI 2 ]) 1/2 U 



N 



N 
j=2 



1/2 



where Y j := H(X -X\ V -V 3 ) - (H * f t )(X , V ) for j > 2. Note that, for j + k, 



E [Y j -Y k ]=E E Y j ■ Y k \(x\ V U 



E 



E^'KX 1 ,^)] -E[Y k \(X l ,V L )\ 



1 ^-tIn 



by independence of the N processes (X J t , V J t ) t >o, where 



E 



1 ttIn 



Y ] \(X ,V 



[H{X\ - y, V\ -w)-(H* f t )(xl V])\ f t (y, w) dydw = 



since (X J t , V 3 t ) has probability distribution f t . Hence, 

= (n- i)e [|r 2 | 2 ] 

<(N-1) [ \H(x-y,v-w)\ 2 df t (x,v)df t (y,w)<C(N-l) 



E 



N 
J'=2 



as in (12.121) due to hypothesis (11.71) . Therefore, we get 

C 



h 3 < 



(2.13) 



Hence, combining the estimates (12.101) , (12.1 II) and (12.131 ) to estimate I 2 , we get that there 
exists C > such that 



C 



h < C(l + r)a(t) + Ce- r + ^ yfitf) 

VN 



(2.14) 



holds for all r > and all < t < T. 



Step 3.- Proof of (fL8l) : It follows from (IO) . (l2~4l) . (12^61) . (12.141) . the above estimates 
and the Young inequality that 

a'(t) < C (1 + r) a(t) + Ce~ r + —yffif) < C (1 + r) a{t) + Ce~ r + — 

for all t G [0, T], all N > 1 and all r > 0. From this differential inequality and Gronwall's 
lemma, we can first deduce that the quantity a(t) is bounded on [0, T], uniformly in N, 
by a constant D > 0. Hence, the function /3(t) := a(t)/(eD) is bounded by 1/e, so that 
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1 — ln/3 < — 2 ln/3. Now, whenever (3{t) > 0, take r : = — In > 0. This choice proves 
that, for any t such that (5{t) > 0, 



P'(t) < C(l - In /3(f)) /3(t) + ^ < -C/3(t) ln/3(t) + ^. 



(2.15) 



Actually, the above inequality is also true whenever f3[t) = (with the convention that 
z log z = for 2 = 0), as can be seen by choosing r := log iV in that case. Hence, (12.151) 
holds for all t E [0, T). Now, the function u(t) := 0(Ct) satisfies u(0) = and 

u < — u m^l^ 

N 

on [0, T/C]. Let finally a{t) be a function on [0,T/C], to be chosen later on. Then the 

map v(t) = u(t) 7V a W satisfies v (0) = and 



v' < -vhiv + A^ a_1 + u lnA^(a + a) < -vlnv + 1 < 



1 



on [0, T/C] provided we choose a(t) = e * < 1. Hence, this choice of a(t) implies the 
bound 

for < t < T/C, that is, 



r(t) <(Vl 



E[|^ - V t f + \T t - V\\ 2 ] = a{t) < CN-*- Ct 

for < t < T, and thus, (fL8l) is proven. 

Step 4.- Proof of (11.1 Oh / If additionally there exists p' > p such that hypothesis (11.91) 
holds, then by the Markov inequality, estimate (12.51) turns into 



E 



1 



\vl\>R 



< e 



-aRP 



E 



Mvl\p 



< Ce 



-aRP 



Hence, following the same proof, the quantity a(t) finally satisfies the differential in- 
equality 

oc'{t) < C(l + r) a{t) + C e- rP ' /P + ^ 

for all iV > 1 and all r > 0. If we choose r = (lniV) p / p ', and since a(0) = 0, this 
integrates to 

a f t ) < 2 ( e C[l+r)t _ 1 \ < l e C(l+r)T = 2 & CT ^(InN)^' T-lnN _ 

K ' ~ N(l + r) \ J ~ N 

Given e > 0, there exists a constant D such that 



C(ln N) p/p 'T - InN < D - (1 - e) IniV 



for all N > 1, so that 



a(t)<2e CT+D N-^ 
for all > 1. This concludes the proof of Theorem ll.il 



□ 



18 



Francois Bolley, Jose A. Cahizo, Jose A. Carrillo 



Remark 2.1. In the setting of s ub section 1 1 . 2 . 21 when the constant diffusion coefficient \/2 
governing the evolution of the particle {X\ , VI) is replaced by a more general a [X l t , Vf; f t ] , 
then, by the Ito formula, we have to control the extra term 

J2 K; f t N ] - a kl [T t y u ft]\ 2 ] ■ 

k,l 

For that purpose we use the Lipschitz property of g, introduce the intermediate term 

i Ji .... 

— 2_^o r ki{(X t , V t ), (X t , V t )) and adapt the argument used above to bound the term 

i=i 

h. 



3 Existence and uniqueness 

This section is devoted to the proof of Theorem 1 1.21 on existence, uniqueness and propa- 
gation of moments for solutions of the particle system (11.21) , the nonlinear process (11.31) 
and the associated PDE (11.41) . This provides a setting under which Theorem 11.11 holds, 
showing that the existence and moment bound hypotheses are satisfied under reasonable 
conditions on the coefficients of the equations and the initial data alone. 

Given T > 0, we will denote by b and C constants, that may change from line to line, 
depending on T, the functions F and H, and moments of the initial datum f . 



3.1 Existence and uniqueness of the particle system 

Let us start by proving point i) of Theorem 1 1.21 In this section we let f G 7^2 (lR 2d ) an d 
consider the particle system for 1 < i < N: 

dXf = V?dt 

i N n n 

dvi = V2dBi - F(xi, v;)dt - j^J2 H W - x l> v t - W 

i=i 

with initial data (Xq, Vq) for 1 < i < N distributed according to fo. Here the (B\) t >Q, for 
i — 1, . . . , N, are N independent standard Brownian motions on \R d . 

Lemma 3.1. Let f e 7^2 (R 2d )» and assume that F, H are locally Lipschitz and satisfy 
(11.121) and (11.151) . For 1 < i < N, take random variables (Xq, Vq) with law fa. Then (13.11) 
admits a pathwise unique global solution with initial datum (Xq, V^for 1 < i < N. 

Proof. The system (13.11) can be written as the SDE 

dZf = a N dB? + b(Zf)cft 
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in R 2dN , where Zf = {X}, V t \ . . . , X? , V t N ). Here cx^ is a constant 2diV x 2diV matrix, 
(Bf ) t > is a standard Brownian motion on R 2d7V , and b : R 2d7V -> R 2dN is a locally 
Lipschitz function defined in the obvious way. Moreover, letting (•, •) be the scalar product 
and || • || the Euclidean norm on R 2dN , then for all Z N = (X 1 , V 1 ,..., X N , V N ), 

N N N 

(Z N , h(Z N )) = J2 xi - yl -J2 Vi ' F ( Xi ' V ^ - M yi ' H i xi ~ xj , V l -V 3 ) 

8=1 1=1 *;i = l 



1 1 N 

<(C+ -)(N + \\Z N \\ 2 ) +C-J2 1*1(1 + \V* - V*|) 



< C(iV + llZ^f) + ^ ^(1 + \Vf + \V j \ 2 ) < C(N + \\Z N \\ 2 ). 

Here we have used the elementary inequality 2ab < a 2 + b 2 , and the bounds (11.121) and 
(11.151 ). This is a sufficient condition for global existence and pathwise uniqueness, see I12TI 
Chapter 5, Theorems 3.7 and 3.11] for instance. □ 

Remark 3.2. For the existence we do not use any properties of symmetry of the system, 
and in particular we do not need the initial data to be independent. On the other hand, the 
condition (11.151) in Lemma I3TI can be relaxed to 

-v ■ H{x,v) < C(l + M 2 ), 

if we impose that H is antisymmetric, i.e., H(—x, —v) = —H(x, v) for all x,v £ R d . Ac- 
tually, in this case we can perform a symmetrization in to estimate the term involving 
if by 

N N 

-— yi • H(X*-X j ,V*-V s ) = -— J2( Vi ~ Vj ) •H{X i -X j ,V i -V s ) 

i,j=l i,j=l 

N 

- + \ yi - Vj \ 2 ) <C(^+||Z N || 2 ). 

3.2 Existence and uniqueness for the nonlinear process and PDE 

In this section we prove points ii) and Hi) in Theorem 11.21 namely, the existence and 
uniqueness of solutions to the nonlinear SDE (11.31) : 

dX t = V t dt 

dV t = V2dB t - F(X t ,V t )dt - H * f t (X t ,V t )dt, 
ft = law(X t ,F t ), law(X ,F ) = /„. 
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and to the associated nonlinear PDE (11.41) : 

dtft + v ■ V x f t = AJ t + V V -((F + H* f t ) f t ) , t > 0, x,v e R d , 

under the hypotheses of Theorem 11.21 Notice that we drop the superscript i for the SDE 
(11.31) . as the problem is solved independently for each i. For the PDE (|1.4I) . we always 
consider solutions in the sense of distributions: 

Definition 3.3. Assume that F, H : R 2d — > R 2d are continuous, and that (11.151) holds. 
Given T > 0, a function / : [0,T] -> V 2 (R 2d ), continuous in the W 2 topology, is a 
solution of equation (Q~4]) with initial data f G V 2 {R 2d ) if for all if e Cg°([0, T) x R 2d ) it 
holds that 

/ ( Podf = - / (9 S (^ S + A v (p s - V v (p s - (F + H*f s ) + V x (p s -v) df s ds. (3.2) 

JlR 2d Jo jR 2d 



Notice that all terms above make sense due to the continuity of F, H, equation (|1.15l) . and 
the bound on the second moment of ft on bounded time intervals (needed for H * f t to 
make sense). For the purpose of this definition, condition (|1.15l) can actually be relaxed 
to \H(x, v) \ < C(l + \x\ 2 + \v\ 2 ), but we will always work under the stronger hypothesis 
below. 

The proof spans several steps, that we split in several subsections. Since some parts 
of the proof hold under weaker conditions on F and H, we will specify the hypotheses 
needed in each part. 

3.2.1 Existence and uniqueness of an associated linear SDE 

Let us first consider a related linear problem: we want to solve the SDE 



dX t = V t dt 

dV t = V2dB t - F(X t , V t )dt - (H * gt){X t , V t )dt 



(3.3) 



for given F, H : R 2d ->■ R d and g : [0, T] V 2 (R 



Lenuna 3.4. Assume that F and H are locally Lipschitz functions satisfying (11.121) . 
(11.151) . and (11.161) . Lef f e V 2 (R 2d ) and g : [0,T] -> V 2 (R 2d ) be a continuous curve 
in the W 2 topology. Then the equation (13.31 ) with inital datum (X , Vq) distributed ac- 
cording to f has a global pathwise unique solution. Moreover this solution has bounded 
second moment on [0, T]. 



Proof. We rewrite (13.31 ) as 

dZ t = adB t + b(t, Z t ) dt 
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on R 2d , where Z t = (X t , V t ). Here a is a 2d x 2d matrix, (B t )t>o is a standard Brownian 
motion on R 2d and 

b{t, x, v) = (v, -F(x, v) - (H * gr t )(a;, u)). 
Let us first observe that the growth condition (11.151 ) on H implies 

\(H *g t )(x,v)\ <C I (l + \v-w\)dg t (y,w) <C (l + \v\ + [ \w\dg t (y,w)) . 

jR 2d \ jR 2d / 

(3.4) 

This together with the Cauchy-Schwarz inequality results in 



v{H*g t ){x,v)\<C il + \v\ 2 + / \w\ 2 dg t (y,w) 



(3.5) 



Estimate (13.41) ensures that, for fixed x, v E R d , the map t h-> (i? * ^^(a;, f ) is bounded 
on [0,T], using the fact that the second moment of g t (and hence its first moment) is 
uniformly bounded on [0, T). Then, the same applies tot ^ b(t, x, v). 

Also, the map (x, v ) M- (iif * u) is locally Lipschitz, uniformly on t G [0, T]; 

indeed, for x, v,y,w G R d , using (11.161) we get 

| (# * ^t) (ar, v) - (ff * s- t ) (y, iw) | 

< f H(x — X,v — V) - H(y — X,w — V) dg t (X,V) 

JR 2d 

[ (1 + \v- V\ p + \w- V\ p )dg t (X,V) 

jR 2d 

1 + \v\ p + \w\ p + [ \V\ p dg t (X,V) . 

The moment of g t above is bounded on [0, T] since p < 2 and the curve is continuous in 
the WVmetric. As F is also locally Lipschitz, we conclude that t x, v) is locally 

Lipschitz. 

Finally, for the scalar product (•, •) on R 2d , we deduce 
((x, v), b(t, x, v )) = x ■ v — v ■ F(x, v) — v ■ [H * g t )(x, v) < C[ 1 + \x\ 2 + \ v \ 2 ) , (3.6) 



<L( 


X 


-y 


+ 


v — w 


<C( 


X 


- y 


+ 


v — w 



by (11.121 ), (13.51 ), and again the fact that the second moment of g t is uniformly bounded on 
[0, T]. As in the proof of Lemma [3T1 these are sufficient conditions for global existence 
and pathwise uniqueness for solutions to (13.41) with square-integrable initial data. 
Moreover, by (13.61 ), 

^E[|X t | 2 + \V t \ 2 } = 2d + 2 E ({X t , V t ), b{t, X u V t )) < 2d + CE[l + \X t \ 2 + \V t \ 2 ] , 



so that by integration the second moment E[|X t | 2 + \V t \ 2 } is bounded on [0, T]. 



□ 
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3.2.2 Existence and uniqueness of an associated linear PDE 

By Ito's formula, the law f t of the solution of (13.31 ) at time t is a solution of the following 
linear PDE: 

dtft + v ■ V x f t = Avft + V v -(f t (F + H* g t )) , (3.7) 

in the distributional sense as in (13.21) of Definition 13.31 Moreover, the curve t >->■ f t is 
continuous for the W 2 topology. Indeed, on the one hand 

Wi(f t , f s ) < E [\x t - x s \ 2 + \v t - v s \ 2 } . 

On the other hand, the paths t >-)■ X t (co) are continuous in time for a.e. to, and (X t , V t ) 
has bounded second moment on [0, T); hence, by the Lebesgue continuity theorem, for 
fixed s the map 1 1-> E [\X t — X s \ 2 + \ V t — V s \ 2 ] is continuous, and hence converges to 
as t tends to s. (Alternatively, one can obtain quantitative bounds on the time continuity 
by estimating E [\X t — X s \ 2 + \ V t — V s \ 2 ] in the spirit of the last equation in the proof of 
Lemma l34t we do not follow this approach here). 

Then one can follow a duality argument in order to show that solutions to (13.71) are 
unique, which we sketch now. Take a solution f t of (13.71) with f = 0; we wish to show 
that f t = for any t > 0. For fixed t > and if smooth with compact support in R 2d , 
consider the solution h t defined for t G [0, t ] of the dual problem 

d t h t + v ■ V x h t = -A v h t + (F + H*g t )- V v h t , 
K = V- 

This is a linear final value problem, and by considering h tQ -t one can show that it has a 
solution by classical arguments. In addition, for each t, this solution h t is a continuous 
function, as can be seen through classical results on propagation of regularity. Then, as h t 
solves the dual equation of (|3.7I) . it holds that 

J t jft h t = ° (*e(0,*o)), 

from which J f to <p = J /o^o = 0. Since ip is arbitrary, this shows that f to = and proves 
the uniqueness. 

3.2.3 Existence and uniqueness for the nonlinear PDE and SDE 

We are now ready to finish proving points ii) and in) in Theorem [L2l 

Step 1.- Iterative scheme: Take f° e V2(R 2d ) and random variables V°) with law 
/o, and let (B t ) t > be a given standard Brownian motion on R d . We define the stochastic 
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processes (X™, V t n ) t >o recursively by 

dX? = V t n dt 

dV t n = V2dB t - F(X» V t n ) dt-(H* /r 1 ) W, K n ) dt, 
W, V") = (X°, F°) 

for n > 1, where /" := law(X", K t n ) and it is understood that / t ° := /° for all t > 0. 
Observe that these are linear SDEs for which existence and pathwise uniqueness are given 
by Lemma l3~4l since all t is continuous for the W 2 topology. We also know from 

section |3~.2.2| that the /" are weak solutions to the PDE 

dtlt + v ■ VJ? = AJ? + V v ■ (f?(F + H * ft- 1 )). 

with initial condition f . More precisely, the following holds for n > 1 and all ip E 

C °°([0,T) x R 2d ): 

[ ip df = - [ [ {dsips+Avips-Vvips-iF+Hxf^+Vxtps^dffds. (3.8) 

jR 2d JO JR 2d 

Step 2.- Uniform estimates on moments of We shall prove the following lemma: 

Lemma 3.5. Assume Hypothesis dl.12b - dl.16b on F and H. Let f be a probability mea- 
sure on R 2 such that 

[ (\x\ 2 + e a ^ P )df (x,v) < +oo 

JR 2d 

for a positive a. Then for all T there exists a positive constant b such that, for the laws / t n 
of the processes (X t n , V t n ), 

sup sup / (|a;| 2 + e bHP ) df?(x, v) < +oo. 

n>l 0<t<T jR 2d 

Proof. We prove this lemma in two steps. 
Step I.- Bound for moments of order 2: Let 

e n (t)= [ \v\ 2 df?(x,v) 

jR 2d 

for n > 1 and t > 0. Using (11.1 2b and (13.51) applied to the measure ft \ we get 

<(t)=T+[ \v\ 2 df? = 2d-2 [ v ■ ((F + H * f^) df?(x, v) 
at j R 2d j$2d 

<2d + 2C [ (l + \v\ 2 + [ \w\ 2 dfr\y,w)) df?(x,v) 

jR 2d V JR 2d / 

<C(l + e n (t) + e n ^(t)) , 
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for diverse constants C depending on F and H but not on t or n. Since moreover 

e n (0) = e (t) = e (0) = / \v\ 2 df (x, v), 

jR 2d 

for all t and n, then one can prove by induction that 

sup / \ v \^<( D+ l -)e^- 1 - 
where -D = max{C, eo(0)}. Moreover the bound 



^/ \x\ 2 df?(x,v)=2 I x-vdf?(x,v)< I \x\ 2 df?(x,v) + e n (t) 

R 2d J R 2d J R 2d 



dt 



ensures that also / \x\ 2 df™(x, v) is bounded on [0, T], uniformly in n. 

jR 2d 

Step 2.- Bound for exponential moments: Let a = a(t) be a smooth positive function to 
be chosen later on and (v) = (1 + l^ 2 ) 1 / 2 . Then we have the following a priori estimate: 



d_ 

It 



e a{t){v)P df?(x,v)= / [dap{v) p - 2 + ap(p-2)\v\ 2 (v} p - 4 + a 2 p 2 \v\ 2 {vy 

R 2d J R 2d 



2 (v} p -' + a 2 p 2 \v\ 2 (v} 2p - i 
+a' (v} p - ap (v) p - 2 v(F + H* f?- l )]e a( > v)V df?(x, v). 

But, by (11.121) . (13.51) applied to / f n_ , and the bound on the moment of order 2 in Step 1, 

—v-(F + H* ft~ l ) <C[1 + \v\ 2 + [ Iwfdff-^y^w) j < C (v) 2 

V JR 2d J 



uniformly on n > 1 and t G [0, T], so 

e a{t){v)P df?{x, v) 



d 
~dt 



R 2d 

< I [Ca {v} p - 2 + C« 2 ( V ) 2p - 2 + a'(t) (v) p + Ca (v) p ] e a{v)P df t n (x, v) 



l R 2d 

< [ [Ca + Ca 2 + a'](v) p e a{v)P df t n (x,v). 

jR 2d 



since p < 2. Choosing a such that Ca + Ca 2 + a' < (for instance, a(t) = Me 2Ct , 
with < M < 1) we conclude that 



dt 



e a{t){v)P df? < 

R 2d 



Hence, we obtain 



e a ® {v)P df?(x,v) < / e amv)P df£(x,v) = e a{0){v)P df (x,v), 

R 2d JR 2d JR 2d 
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which is finite provided a(0) < a, which can be satisfied by taking M = min{a, 1} 
above. Then, taking b = a(T) we conclude that 

sup sup / e^df? < oo. (3.9) 

n>0 0<t<T JU 2d 

□ 

We notice for later use that as a direct consequence of (13.91) using the Markov inequal- 
ity, there exists C > such that 

sup sup E [1\v"\>r] < e~ bRP sup sup E [ e b ^^ P ] < Ce~ bRP . (3.10) 

0<t<T n>0 * 0<t<T n>0 

Step 3.- Existence for the nonlinear PDE. We intend to carry out an argument analo- 
gous to the one for the existence and uniqueness of solutions for the 2D Euler equation in 
fluid mechanics, found for example in ||28l . We will prove that the / t n converge to a limit, 
and that this limit is a solution to the nonlinear PDE. To simplify notation we drop time 
subscripts and use the following shortcuts: 

v n ._ yn+l _ yn ? % n ._ _ ;= yn^ ^ . = ^ y y 

Also, we write 

: = E [|a; n | 2 ] + E [\v n \ 2 ] = E [\Z n+1 - Z n \ 2 ] . 
We compute, by Ito's formula, and for any n > 1, 

j t E [\x n \ 2 ] = 2 E [x n ■ v n ] < E [\x n \ 2 ] + E [K| 2 ] = 7 n (t) , (3.11) 

with 

' -E [\v n \ 2 ] = - E [v 11 ■ (F(Z n+l ) - F(Z n ))] 
- E 



2dt 



■ ((h * m(z n+i ) - (h * /r 1 )^"))] == ^ + r 2 . 0.12) 

Estimate for T\. We decompose the term 7\ as 

Ti = - E [(V n+l -V n ) ■ (F{X n+1 ,V n+1 ) -F{X n+1 ,V n ))] 
- E [(V n+1 -V n ) ■ (F(X n+1 , V n ) - F(X n , V n ))] , 

which by (ll.13h - dl.14b is bounded above by 

Ti < L E [K| 2 (l + I^T + \V n+1 \ p )] + LE [|w n | |a; n | (1 + |y n | p )] 
=:T n +T 12 . 
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Given R > 0, we bound Tn as follows: 



T n < L(l + 2R P ) E [\v n \ 2 ] + L E [K| 2 |^T%"|>;?] + LE [\v n \ 2 \V n+1 \ p l lvn+ i ]>R ] 
<L(1 + 2RP) 1 n {t) + LE [\v n \ 4 \V n \ 2p ] 1/2 E [t\ Vn>R ] 1/2 

+ LE[K| 4 |^ +1 | 2 p] 1/2 E[V. +1 | >fl ] 1/2 
< C (1 + iF) 7 n W + C E [l|vn |>fl ] 1/2 + C E [l|vn + i|>ji] 1/2 , 



where we have used the uniform-in- n bound on moments of f n obtained in (13 .9b ■ For the 
term T 12 , we get 

T 12 < L(l + E [\v n \\x n \] + L E [Kl |x"| KTV»|>*] 

< - (1 + JP) 7 »(f) + £ E [K| 2 ] + - E [K| 2 M^V^h] 



2 

< L (1 + i?)^*) + I E [Kl 4 m 4 "] 1/2 E 
<L(l + ^) 7 "(t) + CE[Vn |>R ] 



1/2 



1/2 



using again the bound on moments of f n in (13.91) . Finally, using (13.101) . there exist con- 
stants b and C such that for all < t < T 



T x <C(l + R p )-f n (t) + Ce 



-bRP 



(3.13) 



for all n and R > 0. 

Estimate for T 2 . On the other hand, for T 2 , 



T 2 =-E 



(H*fn(z 



■E 



H*(ft-ft)(Z 



=:T 21 +T 22 . (3.14) 
For the first term T 21 , we proceed analogously to the estimates of T u and T 12 to obtain 



T 2 i = — E 
<LE 



r ^ / {H(Z 

R 2d 



n+1 



z) - H(Z n - z))f r t \x } v) dx dv 

l _ Z n \(l + |^ n | P + + \v\ P )f?(x, 

<CE [\v n \\Z n+1 - Z n \ (1+ \V n \ p + \V n+1 \ p )] 
<C(l + R p )-f n {t) + Ce- bRP 



v) dx dv 



(3.15) 



where the last steps where not detailed since they are very similar to the estimates of Tn 
and Tu, and the uniform moment bounds (13.91 ) and (11.161) were used. Now, for T 22 , ob- 
serve that, taking g n : = law((X n , V n , V n ~ 1 )), we can write the following identity 

A:= H * (/ t n - f t n - 1 )(Z n )= [ (H(X n - x, V n - v) - H(X n - y, V n - w)) dg n 
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where we used the shortcut notation dg n for the measure dg n (x, v, y, w). By the Cauchy- 
Schwarz inequality and the uniform moment bounds (13.91) . we get 



\A\<L / (\x-y\ + \v-w\)(l + \V n \ p + \v\ p + \w\ p )dg n (x,v,y,w) 
<L{l + \V n \ p )E[\x n - l \ + \v n - l \] 

+ l([ (\x-y\ + \v-w\) 2 dgA ( f (\v\* + \w\*) 2 dA 

<C(1+ \V n \ p )^ [Ix"" 1 ! + Iv^l] +C(E [\Z n - Z n - l \ 2 ]) 1/2 

<c(i + \v n \ p )j n ~\t) 1/2 ■ 

Using the expression of T 22 , we deduce 

T 22 < E [\v n \\H * {f? - fr'KZ^l] < C7"'~W /2 E [K|(l + |y n | p )] 

< Cy*" 1 ^) + E [K| 2 ] E [\V n \ 2p ] < C-f n -\t) + C-f n (t) , (3.16) 

again by the Cauchy-Schwarz inequality and the uniform moment bounds (13.91) . 
Hence, putting (l3~l3l) . (1515b . (|3l5T) and (I3l6l) in (I31TT) and (I3l2l) . we conclude 

^7 n W < C(l + R p h n (t) + Cj n -\t) + Ce~ bRP . 

Induction Argument. Taking R > 1 we may write that 

j t l n {t) < C ( n n (t) + r~\t) + e- r ) , (3.17) 
for some other constant C > and for any all r > 1. Gronwall's Lemma then proves that 

l n (t) <C [ e Cr( ^ s) 7 n - 1 (s) ds + Ce- r te Cr *, 
Jo 

and iterating this inequality gives 



7 n (t) < C n f e Cr ^ s h°(s) ( * S) \' ds + Cte- r e CH V 

(n-1)! ^ 



J.71— 1 

<C n e Cr '-- — / 1 °(s)ds + Cte- r e Crt e ct 

[n-iy. Jo 

< C n e Crt t n sup 7°(s) + Cte^e* *-** . 



se[o,t] 

Taking r = n, we obtain 



l n (t) < exp (n(\n(Ct) + Ct)) sup 7 °(s) + Cte ct e n{ct ~ l) . 

se[o,t] 
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Choosing < T* < T small enough such that A := max{CT, - 1, ln(CT») + CT*} < 0, 
then 

sup 7 n (t) < e An [ sup 7 °(s) + CTe CT J . (3.18) 

te[0,T«] \«6[0,T»] y 

Since by definition W 2 (f? +1 ,f?) 2 < E|Z™ +1 - Z™| 2 = 7 n (t), we conclude that the 
sequence of curves {t G [0,2*] >->■ f r t l } n >o is a Cauchy sequence in the metric space 
C([0, 2 1 *], P 2 (R 2d )) equipped with the distance 

m(f,g) ■= sup W 2 (/ t ,<fc). 

te[o,T,] 

By completeness of this space, we define / G C([0, T*], P 2 (IR 2<i )) by / 4 := lim n _^ +00 /" 
fort G [0,2;]. 

This convergence and the uniform moment bounds on f n in (13.91 ) allow us to pass to 
the limit in (13. 8b . Let us point out how to deal with the nonlinear term in the equation: 
observe first that for fixed s, it is given by 

/ V v ip s -H*f:~ l df:= [ V v <p s (x,v)-H(x-y,v-w)df:- 1 (y,w)df?(x,v). 

(3.19) 

But on the one hand and /" converge to f s for the Wi topology, hence so does 
fs' 1 ® fs t0 fs ® fs (in R 4d ). On the other hand 

\V v ip s (x,v) ■ H(x -y,v -w)\ < ||V„^ a ||ioc(l + \v\ + \w\) 

by (11.151) . Therefore (13.191) converges to 



V v (p s (x,v) ■ H(x - y,v - w)df s (y,w)df s (x, v) = / V v (f s ■ H * f s df s 

for all s. Uniform-in-s bounds finally allow to pass to the limit in the integral in s. 

With this, we have shown that f t is a solution on [0, 2*] of the nonlinear PDE (|1.4I) . 
Now, one can extend the solution to the whole interval [0, T] by iterating this procedure, 
starting from 2*. This can be done since the additional time T[ for which we can extend 
a solution starting at T* depends only on moment bounds on fx, , for which we have the 
bound (f3T9b . valid up to T. 

Step 4.- Existence for the nonlinear SDE: Now we use f t to define the process (X t , V t ) 
by 

dX t = V t dt 

dV t = V2dB t - F(X t , V t ) dt-(H* f t )(X t , V t ) dt, (3.20) 
(X ,V ) = (X°,V°) 



29 



thanks to Lemma 13.41 Observe that for all t, f t is the WVlimit of /f and p < 2, so that 

/ \v\Pdf t (x,v) = [ \v\*df?(x,v)<C, 

uniformly in t G [0, T] since the / t n have second moments bounded according to Lemma l331 
If g t is the law of (X t , V t ), then, as in section |3^2.2l g t is a weak solution of the linear PDE 

9t9t + v ■ V x g t = A v g t + V V ((F + H* f t )g t ). 

Of course, f t is also a solution of the same linear PDE; by uniqueness of solutions to 
this linear PDE (see again section 13X21) . we deduce that f t = g t , and hence (X t , V t ) is a 
solution to the nonlinear SDE (Ob on [0, T] . 

Step 5.- Uniqueness for the nonlinear PDE (11.41 ): Now, take two solutions f 1 , f 2 of 
the nonlinear PDE, and define the processes (X}, V^) and (Xf, V t 2 ) by (13.201) . putting 
f 1 and f 2 in the place of /, respectively. As the law of (X\, V t l ) (i = 1, 2) solves the 
linear PDE (13.71) with f\ instead of g t , so this law must in fact be fl by uniqueness of the 
linear PDE. Then, if we follow the same calculation we did in step 3 above, we obtain the 
following instead of (13.171) : 

j t l(t)<C n (t) + Ce- r , (3.21) 

for a constant C and any r > 1, with 7 (t) := E [\X} - X 2 \ 2 } + E [l^ 1 - V 2 \ 2 ]. For the 
above to be valid, we need a bound on exponential moments of f} and f 2 . This estimate 
can be obtained in a similar way as in Lemma [331 and therefore, we omit the proof: 

Lemma 3.6. Assume hypotheses dl.12b - dl.16b on F and H. Let (ft)t>o be a solution to 
(1 1 .4-1) with initial datum a probability measure f on R 2d such that 

[ (\x\ 2 + e alvlP )f (x,v)dxdv < +oo 

for a positive a. Then for all T there exists b > which depends only on fo and T, such 
that 

sup / (jx\ 2 + e b ^ P ) f t (x,v) dx dv < +oc. 

0<t<T J Rid 

Observe now that 

7(0) = E[|X 1 -X 2 | 2 +|^ 1 -K) 2 | 2 ] =0 
since X$ = X% = X°, and similarly for V. 
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Assume now that 7 is non identically 0. Then, with the same argument as in step 3 of 
the proof of Theorem ll.il whenever < 7 (t) < 1/e we can choose r := — ln 7 (t) in 
(13.211) to obtain 

Ly( t ) < - Cl {t) ln 7 (t) + C 7 (t) < -2C7 7 (f) ln 7 (t). (3.22) 
at 

If 7 (t) = at some point, then one can see that 4 7 (t) < by letting r — > +00 in (13.211) . 
and in that case the inequality (13.221) holds trivially (again setting zlnz = 0atz = 
by continuity). Hence, (|3.22l) holds as long as < 7 (t) < 1/e. By Gronwall's Lemma, 
this implies that j(t) = for t G [0, T] and shows that f} and /| coincide, proving that 
solutions to the nonlinear PDE are unique. 

Step 6.- Uniqueness for the nonlinear SDE (11.31) : Take two pairs of stochastic pro- 
cesses pfj 1 , Vl) and (X t 2 , V t 2 ) which are solutions to the nonlinear SDE (11.31) . Then, their 
laws are solutions to the nonlinear PDE (11.41) . and by the previous step we know that they 
must be the same. If we call f t their common law, then both (X^, V^) and (X 2 , V t 2 ) are 
solutions to the linear SDE (13.31 ) with f t instead of g t , and by uniqueness of this linear 
SDE (see section 13". 2. II) . they must coincide. 

This concludes the proof of Theorem II .2[ 
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